Search CORE

8 research outputs found

Sociocultural knowledge is needed for selection of shots in hate speech detection tasks

Author: Köksal Abdullatif
Maronikolakis Antonis
Schütze Hinrich
Publication venue
Publication date: 05/04/2023
Field of study

We introduce HATELEXICON, a lexicon of slurs and targets of hate speech for the countries of Brazil, Germany, India and Kenya, to aid training and interpretability of models. We demonstrate how our lexicon can be used to interpret model predictions, showing that models developed to classify extreme speech rely heavily on target words when making predictions. Further, we propose a method to aid shot selection for training in low-resource settings via HATELEXICON. In few-shot learning, the selection of shots is of paramount importance to model performance. In our work, we simulate a few-shot setting for German and Hindi, using HASOC data for training and the Multilingual HateCheck (MHC) as a benchmark. We show that selecting shots based on our lexicon leads to models performing better on MHC than models trained on shots sampled randomly. Thus, when given only a few training examples, using our lexicon to select shots containing more sociocultural information leads to better few-shot performance

arXiv.org e-Print Archive

Ethical scaling for content moderation: Extreme speech and the (in)significance of artificial intelligence

Author: Maronikolakis Antonis
Udupa Sahana
Wisiorek Axel
Publication venue: Ludwig-Maximilians-Universität München
Publication date: 08/05/2023
Field of study

In this article, we present new empirical evidence to demonstrate the severe limitations of existing machine learning content moderation methods to keep pace with, let alone stay ahead of, hateful language online. Building on the collaborative coding project “AI4Dignity” we outline the ambiguities and complexities of annotating problematic text in AI-assisted moderation systems. We diagnose the shortcomings of the content moderation and natural language processing approach as emerging from a broader epistemological trapping wrapped in the liberal-modern idea of “the human”. Presenting a decolonial critique of the “human vs machine” conundrum and drawing attention to the structuring effects of coloniality on extreme speech, we propose “ethical scaling” to highlight moderation process as political praxis. As a normative framework for platform governance, ethical scaling calls for a transparent, reflexive, and replicable process of iteration for content moderation with community participation and global parity, which should evolve in conjunction with addressing algorithmic amplification of divisive content and resource allocation for content moderation

Open Access LMU

This joke is [MASK]: Recognizing Humor and Offense with Prompting

Author: Li Junze
Maronikolakis Antonis
Pu Pearl
Schütze Hinrich
Xie Yubo
Zhao Mengjie
Publication venue
Publication date: 01/01/2023
Field of study

Humor is a magnetic component in everyday human interactions and communications. Computationally modeling humor enables NLP systems to entertain and engage with users. We investigate the effectiveness of prompting, a new transfer learning paradigm for NLP, for humor recognition. We show that prompting performs similarly to finetuning when numerous annotations are available, but gives stellar performance in low-resource humor recognition. The relationship between humor and offense is also inspected by applying influence functions to prompting; we show that models could rely on offense to determine humor during transfer

Open Access LMU

Artificial Intelligence, Extreme Speech and the Challenges of Online Content Moderation

Author: Csuka Laura
Hickok Elonnai
Maronikolakis Antonis
Nann Leah
Schuetze Hinrich
Udupa Sahana
Wisiorek Axel
Publication venue
Publication date: 01/06/2021
Field of study

Open Access LMU

Listening to Affected Communities to Define Extreme Speech: Dataset and Experiments

Author: Jabbar Haris
Maronikolakis Antonis
Muresan Smaranda
Nakov Preslav
Nann Leah
Schütze Hinrich
Udupa Sahana
Villavicencio Aline
Wisiorek Axel
Publication venue: Ludwig-Maximilians-Universität München
Publication date: 01/05/2022
Field of study

Building on current work on multilingual hate speech (e.g., Ousidhoum et al. (2019)) and hate speech reduction (e.g., Sap et al. (2020)), we present XTREMESPEECH, a new hate speech dataset containing 20,297 social media passages from Brazil, Germany, India and Kenya. The key novelty is that we directly involve the affected communities in collecting and annotating the data – as opposed to giving companies and governments control over defining and combatting hate speech. This inclusive approach results in datasets more representative of actually occurring online speech and is likely to facilitate the removal of the social media content that marginalized communities view as causing the most harm. Based on XTREMESPEECH, we establish novel tasks with accompanying baselines, provide evidence that cross-country training is generally not feasible due to cultural differences between countries and perform an interpretability analysis of BERT’s predictions

Open Access LMU

Wine is Not v i n. On the Compatibility of Tokenizations Across Languages

Author: Dufter Philipp
Huang Xuanjing
Maronikolakis Antonis
Moens Marie-Francine
Schütze Hinrich
Specia Lucia
Yih Scott Wen-tau
Publication venue: Ludwig-Maximilians-Universität München
Publication date: 01/11/2021
Field of study

The size of the vocabulary is a central design choice in large pretrained language models, with respect to both performance and memory requirements. Typically, subword tokenization algorithms such as byte pair encoding and WordPiece are used. In this work, we investigate the compatibility of tokenizations for multilingual static and contextualized embedding spaces and propose a measure that reflects the compatibility of tokenizations across languages. Our goal is to prevent incompatible tokenizations, e.g., “wine” (word-level) in English vs. “v i n” (character-level) in French, which make it hard to learn good multilingual semantic representations. We show that our compatibility measure allows the system designer to create vocabularies across languages that are compatible – a desideratum that so far has been neglected in multilingual models

Open Access LMU